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Severe acute respiratory syndrome (SARS) is a deadly form of pneumonia caused by a novel coronavirus, a 
viral family responsible for mild respiratory tract infections in a wide variety of animals including humans, 
pigs, cows, mice, cats, and birds. Analyses to date have been unable to identify the precise origin of the SARS 
coronavirus. We used Bayesian, neighbor-joining, and split decomposition phylogenetic techniques on the 
SARS virus replicase, surface spike, matrix, and nucleocapsid proteins to reveal the evolutionary origin of this 
recently emerging infectious agent. The analyses support a mammalian-like origin for the replicase protein, an 
avian-like origin for the matrix and nucleocapsid proteins, and a mammalian-avian mosaic origin for the 
host-determining spike protein. A bootscan recombination analysis of the spike gene revealed high nucleotide 
identity between the SARS virus and a feline infectious peritonitis virus throughout the gene, except for a 200- 
base-pair region of high identity to an avian sequence. These data support the phylogenetic analyses and 
suggest a possible past recombination event between mammalian-like and avian-like parent viruses. This event 
occurred near a region that has been implicated to be the human receptor binding site and may have been 


directly responsible for the switch of host of the SARS coronavirus from animals to humans. 


Severe acute respiratory syndrome (SARS) is a potentially 
fatal atypical pneumonia that arose in Guangdong Province of 
the People’s Republic of China in November 2002. Within 6 
months SARS spread to over 30 countries and killed over 700 
people (27, 37). This outbreak has had a profound impact on 
public health and economies worldwide and reinforced the 
danger of emerging infectious diseases in densely populated 
societies. 

SARS is believed to be a viral zoonotic disease (9). Viral 
zoonoses are responsible for most of the emerging infectious 
diseases (8, 21) and often originate due to genetic exchange 
between viruses with different host specificities. Shifts in the 
host range of the influenza virus have been shown to be due to 
the genetic exchange of the host-determining neuraminidase 
and hemagglutinin genes (17, 36). One such genetic exchange 
event is believed to have been responsible for the 1918 Spanish 
influenza pandemic which killed over 20 million people. There 
is also some evidence that such genetic reassortment may have 
occurred in the virus that causes SARS. 

SARS is caused by a novel coronavirus (6, 19, 24, 40). Coro- 
naviruses are enveloped, positive-strand RNA viruses that 
cause localized but mild respiratory tract infections in a wide 
variety of animals including pigs, cows, mice, horses, and birds 
(35). In humans, coronaviruses normally cause symptoms as- 
sociated with the common cold (12). The SARS coronavirus is 
similar in composition and genetic organization to previously 
described coronaviruses. Its genome includes genes encoding a 
replicative polyprotein (PPlab), surface spike glycoprotein (S), 
envelope, matrix (M), and nucleocapsid (N) proteins (5, 22). 
The SARS coronavirus also encodes an additional nine pre- 
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dicted open reading frames whose precise function is unknown 
(22). The S protein is of particular importance as it is a key 
virulence and host range determinant (3, 4, 14, 25, 26, 31, 34). 
The region between amino acids (aa) 417 and 547 of the 
human coronavirus 229E S protein has been shown to be the 
binding region for the human aminopeptidase N receptor (3, 
38). Targeted recombination between feline and mouse S pro- 
teins has been shown to enable feline-specific coronaviruses to 
infect mice (10). Comparative analyses of the SARS corona- 
virus genomes revealed two amino acid substitutions within the 
S protein, signaling possible positive selection due to immuno- 
logical pressures (29). 

The origin of the SARS coronavirus has been the subject of 
intense speculation. Enserink (7) recently reported that the 
SARS virus could be isolated from masked palm civets avail- 
able in the food markets of southern China. The civet family 
(Viverridae) is in the suborder Fissipedia and is a close relative 
of the cat family (Felidae) (32). Despite this hopeful lead, 
phylogenetic analyses have been unable to definitively identify 
the evolutionary origin of this virus. Previous studies have 
reported that the SARS coronavirus is phylogenetically distinct 
and not a composite of other coronaviruses (13, 22, 28). Rota 
et al. (28) were unable to identify any partitions in the pattern 
of polymorphism along the SARS genome consistent with re- 
combination (13). There are also claims that the SARS virus is 
not a host range mutant of any previously described coronavi- 
ruses due to its low sequence identity to known coronaviruses 
(135-29). 

To further investigate the origins of the SARS coronavirus, 
we performed a detailed phylogenetic analysis of the PP1ab, S, 
M, and N proteins with Bayesian, neighbor-joining, and split 
decomposition methods. Additionally, we examined the gene 
that encodes the host-determining S protein for evidence of 
past recombination events that may have resulted in a new 
ability to infect human hosts. Our results indicate that the 
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SARS virus is in fact a mosaic of mammalian and avian-like 
viruses and that recombination between the parent viruses may 
have occurred in the host-determining S gene. 


MATERIALS AND METHODS 


Sequence retrieval. Homologous sequences for the PPlab, S, M, and N pro- 
teins of the SARS coronavirus strain Urbani (AY278741) were acquired by 
BLASTP against the GenBank nr database. Redundant sequences or sequences 
sharing =95% protein similarity (as determined by a pairwise BLAST search) 
were discarded. An attempt was made to include sequences from all three 
taxonomically described groups of coronaviruses, and whenever possible, the 
same representative taxa (accessions) were used for all gene genealogies. Non- 
coronavirus sequences that were similar to the SARS virus were identified for the 
SARS PPlab polyprotein by performing BLASTP with the following Entrez 
limit: “polyprotein OR PPlab OR replicase NOT coronavirus.” This search 
resulted in the identification of two domains of the SARS PP1ab polyprotein (aa 
4751 to 5024 and aa 5276 to 5851) that show significant similarity (98.6 bits; E = 
6e — 18; 37% similarity) to the gill-associated okavirus from the Roniviridae, a 
sister family to the Coronaviridae. Imposing a similar Entrez limit on BLASTP 
and BLASTN searches of the S, M, and N proteins did not yield any significant 
hits to viruses in other families. 

Phylogenetic analyses. Amino acid alignments used in the phylogenetic anal- 
yses were made with ClustalX version 1.83 (33) with default parameters. An 
exception to this protocol was the M protein alignment, where a multiple align- 
ment gap opening penalty of 5 was used. Alignments were imported into 
Genedoc version 2.5 (http:/Avww.psc.edu/biomed/genedoc) where gaps and am- 
biguously aligned regions were deleted. The two domains of the gill-associated 
okavirus, totaling 880 aa, were concatenated and edited in a similar manner. The 
multiple sequence alignments and phylogenetic trees are available at http://www 
-botany.utoronto.ca/ResearchLabs/guttmanLab. 

Phylogenetic analyses were made on 840, 162, and 345 aa of the S, M, and N 
proteins, respectively. The rooted PPlab phylogeny was based on an 883-aa 
region to include the gill-associated okavirus, whereas the unrooted PPlab phy- 
logeny was generated using a 322-aa region for which there was a feline coro- 
navirus sequence available. Phylogenetic trees were constructed only with the 
amino acid data due to the extremely high level of nucleotide diversity. 

Neighbor-joining trees were made with PHYLIP version 3.6a (J. Felsenstein, 
Phylogeny inference package, Department of Genetics, University of Washing- 
ton, Seattle) using SEQBOOT, PROTDIST, NEIGHBOR, and CONSENSE 
programs. Protein distance calculations were based on the Jones-Taylor-Thorn- 
ton protein weight matrix with 1,000 bootstrap (BS) replicates. All other vari- 
ables were set as default. Bayesian trees were constructed with MRBAYES 
version 3.04b (15) using Jones-Taylor-Thornton protein weight matrix with 
500,000 generations and a burn-in of 100. All other values were set as default. 
Consensus trees were generated by majority rule and viewed with TREEVIEW 
version 1.6.6 (23). The topologies of the Bayesian and neighbor-joining trees for 
each protein were identical; therefore, only the Bayesian trees are shown. 

Split decomposition analyses were performed with SplitsTrees version 3.2 (16) 
using Hamming correction and are presented with equal edge length for clarity. 
Recombination effectively causes lineages to coalesce forward in time, resulting 
in trees that have reticulations or a network structure rather than the simple 
branching structure seen with most phylogenies. Split decomposition does not 
force evolutionary relationships to be strictly bifurcating or multibranching but 
permits network relationships. A split decomposition graph will look less tree- 
like and more net-like as the influence of recombination becomes more impor- 
tant in the history of a set of taxa. 

Bootscanning. A nucleotide alignment of the gene encoding the S protein was 
generated with ClustalX version 1.83 with a pairwise and multiple sequence gap 
opening penalty of 10 and a gap extension penalty of 2. The alignment was edited 
to remove all gaps and regions that could not be aligned. One representative 
sequence was used from both the avian infectious bronchitis coronavirus group 
and the group 1 coronaviruses (feline). Two representatives were chosen from 
the group 2 coronaviruses, representing the two major divisions in that group 
(murine and human). Bootscanning (30) was performed by using Simplot version 
3.2b (20) with the F84 model (window size, 200 bp; step, 20 bp) and with the 
SARS S nucleotide sequence as the query. Bootscanning uses a sliding window 
approach to detect recombination. At each window position a phylogenetic 
analysis is performed and the data are bootstrapped. The stronger the BS sup- 
port for the clade that includes the query sequence is, the higher the bootscan 
value will be. Recombination may change the branching order of phylogenies, 
thereby potentially moving the query sequence to another clade. Significant 
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changes in phylogenetic relationships from window to window will result in 
changes in bootscan values and are indicative of probable recombination events. 

Bootscanning was performed on the nucleotide sequences since there is no 
application available to perform this analysis on amino acid sequences. Given the 
extensive nucleotide diversity of the sample, we could perform a bootscan anal- 
ysis only on a relatively small region of the SARS genome and had to restrict our 
analysis to a much smaller group of taxa. 

Relative rate test. Tajima’s relative rate test was performed with the MEGA2 
(18) program on the amino acid alignments. Homogeneity in the rate of evolu- 
tion among lineages supports the use of midpoint rooting for the phylogenetic 
trees. 


RESULTS AND DISCUSSION 


We examined the phylogenetic relationships of the SARS 
coronavirus and related viruses by using Bayesian, neighbor- 
joining, and split decomposition phylogenetic methods. Sepa- 
rate phylogenies were constructed from each protein sequence. 
With the exception of the PPlab replicase polyprotein (dis- 
cussed below), all trees are presented as unrooted dendro- 
grams since there was no objective means to root the separate 
trees. 

The M and N genes, which encode two of the primary struc- 
tural proteins of the SARS virus, are on the 3’ end of the 
genome. Phylogenetic analyses show that both of these pro- 
teins diverge from the lineage leading to the avian infectious 
bronchitis coronaviruses. Support for an avian-like viral ances- 
tor for these proteins rests upon the branch separating SARS 
and avian viruses from all of the other coronaviruses (Fig. 1B). 
This branch has extremely strong support, with Bayesian pos- 
terior probability (PP) and neighbor-joining BS values for the 
avian-SARS M protein clade at 100 and 89%, respectively, and 
support for the avian-SARS N protein clade at 100 and 97%, 
respectively. Bayesian PP values are generally considered sig- 
nificant when they are greater than 95% (15), while BS scores 
are considered significant at 70% (11). No noncoronavirus 
sequences with any similarity to either the M or N proteins 
were found in GenBank; therefore, there is no objective way to 
root these trees. Nevertheless, the most exclusive monophy- 
letic group (with respect to accepted coronavirus taxonomy) 
that includes the two SARS proteins also includes the avian 
viruses and excludes any mammalian (groups 1 and 2) corona- 
viruses. These relationships are also supported by the midpoint 
rooting of the two trees. The M protein tree midpoint roots on 
the branch linking SARS to the group 1 (porcine, feline, and 
canine) and group 2 (bovine and murine) coronaviruses. The N 
protein tree midpoint roots on the branch leading to the group 
1 coronaviruses. The appropriateness of midpoint rooting is 
supported by a Tajima’s relative rate test, which finds no evo- 
lutionary rate heterogeneity among the major coronavirus 
groups (data not shown). Trees presented by Marra et al. (22) 
also support the grouping of the SARS virus M and N proteins 
with the avian clade. These results are also consistent with the 
characteristic avian-like elements of the SARS genome, in- 
cluding a stem-loop II-like motif at the 3’ end that is found 
only in avian coronaviruses (22). Marra et al. (22) suggest that 
this motif has been acquired through horizontal transfer, a 
theory supported by our study. 

Split decomposition analyses of the SARS M and N proteins 
also support descent from an avian-like coronavirus ancestor 
(Fig. 2). The N protein split decomposition graph largely mir- 
rors the Bayesian and neighbor-joining trees. The SARS N 
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FIG. 1. (A) Organization of the SARS coronavirus genome. (B) Bayesian-inferred unrooted phylogenies of the SARS proteins, generated with 
MRBAYES version 3.04b using Jones-Taylor-Thornton protein weight matrix. Tree sampling was performed for 500,000 generations, and a 
majority-rule consensus tree was built with a burn-in of 100. The PPlab phylogeny was based on a conserved 322-aa region, corresponding to aa 
3230 to 3552 of the polyprotein. The S, M, and N genealogies are based on edited, whole-protein alignments of 840, 162, and 345 aa, respectively. 
Branch confidence values are given as PPs. Congruent trees were obtained by neighbor joining, with critical nodes being supported by bootstrap 
values that are >89%. Asterisks denote the location of midpoint rooting. Group 1 coronaviruses have initial capital letters. Group 2 coronaviruses 
are in lowercase letters. Group 3 coronaviruses are in italics. 
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FIG. 2. Split decomposition graph of the SARS coronavirus genome. The PPlab phylogeny was based on a conserved 322-aa region, 
corresponding to aa 3230 to 3552 of the polyprotein. The S, M, and N genealogies are based on edited, whole-protein alignments of 840, 162, and 
345 aa, respectively. The asterisk on the PPlab protein denotes the branch that is lost when the whole protein is analyzed. 


protein diverges from the lineage leading to the avian infec- 
tious bronchitis coronaviruses. Since there is no reticulation in 
this part of the graph, there is little evidence of recombination 
between these sequences. BS support for the SARS-avian 
clade is marginal at 71.2%. The M protein split decomposition 
graph is more informative. SARS again forms a significant 
clade with the avian coronaviruses (BS, 74.1%), but unlike the 
N protein, the M protein shows reticulations in the SARS- 
avian clade that are clear evidence for recombination between 
these viruses. 

The phylogeny of the PPlab polyprotein is dramatically dif- 
ferent from that of the M and N proteins. The tree shown in 
Fig. 1B is from a 322-aa region of the PPlab protein, corre- 
sponding to aa 3230 to 3552, for which a feline coronavirus 
sequence was available. The branching structure of the tree 
generated from this subsection of the PPlab protein is identi- 


cal to the structure of the tree generated from the entire 
7,073-aa PPlab protein. Unlike the previous two proteins, the 
SARS PP tab protein does not clearly fall into any clade. SARS 
diverges near the center of the unrooted dendrogram shown in 
Fig. 1B. Its position makes phylogenetic clustering problem- 
atic, since it could cluster significantly (BS, 100%; PP, 100%) 
with either the group 2 (murine and bovine) coronaviruses or, 
just as significantly, with the group 1 (feline, porcine, and 
human) and avian coronaviruses, depending on how the tree is 
rooted. To clarify this issue, we searched GenBank for more 
distantly related sequences that could be used to root this tree. 
By restricting the BLASTP similarity search to noncoronavirus 
sequences, we were able to identify a gill-associated okavirus 
with significant similarity (98.6 bits; E = 6e — 18; 37% simi- 
larity) to the SARS PPlab protein. There were two conserved 
domains of the okavirus, totaling 880 aa, which showed simi- 
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FIG. 3. Bayesian-inferred rooted phylogeny of the PP1lab polypro- 
tein. Two conserved regions of the polyprotein, homologous to se- 
quences found in the gill-associated okavirus, were concatenated and 
edited to eliminate gaps and ambiguously aligned regions. A phylogeny 
was generated on the resulting 880-aa alignment as described in the 
legend to Fig. 1. The tree was rooted on the gill-associated okavirus, a 
member of the Roniviridae, which is a sister family to the Coronaviri- 
dae. Branch confidence values are given as PPs. 


larity to the SARS PPlab protein. These regions were concat- 
enated, edited, and used to root the PPlab polyprotein phy- 
logeny. Unfortunately, this region does not overlap with any 
region for which the feline infectious peritonitis virus showed 
similarity to the PPlab sequence data; therefore, the feline 
coronavirus sequence was excluded from the rooted phylogeny. 
The rooted PP1lab phylogeny (Fig. 3) shows strong support for 
monophyly of the SARS virus with the group 2 murine-bovine 
coronaviruses (BS, 83%; PP, 100%), indicating that the SARS 
PPlab protein originated from a mammalian-like ancestral 
virus. This topology is also supported by midpoint rooting (Fig. 
1B, asterisk). A phylogenetic analysis conducted by Peiris et al. 
(24) on approximately 215 aa of the PPlab polyprotein of the 
SARS Hong Kong 03/2003 isolate also supports the monophyly 
of SARS with the murine-bovine group. This conclusion is 
consistent with the view of Anand et al. (2), who consider the 
SARS virus a variant of the murine-bovine coronaviruses even 
though the SARS coronavirus lacks the hemagglutinin esterase 
gene that characterizes this group (22). In addition, the core 
transcription-regulating sequence of SARS, which serves to 
regulate the discontinuous transcription of subgenomic RNAs 
(39), has been suggested to be more like those of the murine- 
bovine coronavirus group (22). 

The split decomposition analysis of the SARS PP 1ab repli- 
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case polyprotein also supports a mammalian origin for this 
protein. The split decomposition graph shown in Fig. 2 was 
derived from the same region of SARS-feline similarity shown 
in Fig. 1B. This graph is similar to the Bayesian and neighbor- 
joining trees in showing that SARS is positioned between the 
avian infectious bronchitis coronaviruses and the group 2 (mu- 
rine and bovine) coronaviruses. When the entire 7,073-aa pro- 
tein is analyzed, the branch connecting the SARS and avian 
viruses (Fig. 2, asterisk) is lost, and the group 2-SARS clade is 
supported by a BS score of 100%. 

The genes encoding the mammalian-like PPlab protein and 
avian-like M and N proteins are separated by the gene encod- 
ing the host-determining S protein. A phylogenetic analysis of 
the S protein places the SARS coronavirus within a clade that 
includes homologous proteins from the avian infectious bron- 
chitis virus and the group 1 coronaviruses, which includes the 
feline infectious peritonitis virus (Fig. 1B). Support for this 
grouping is highly significant (BS, 100%; PP, 100%). A relative 
rate test again showed no rate heterogeneity between major 
coronavirus groups, supporting the use of midpoint rooting. 
The midpoint root of this tree is on the branch leading to the 
group 1 coronaviruses. 

The split decomposition analysis of the S protein again 
strongly supports the Bayesian and neighbor-joining analyses 
(Fig. 2). SARS forms a highly significant clade (BS, 100%) with 
the avian infectious bronchitis coronaviruses and the group 1 
(feline, canine, and porcine) coronaviruses. The highly reticu- 
late nature of this clade supports the occurrence of extensive 
recombination in the evolutionary history of these samples. 
These findings are consistent with the view that the SARS S 
protein is a mosaic of coronaviruses from two distinct groups— 
the avian and group 1 coronaviruses. 

The phylogenetic relationships defined above are not simply 
dependent on the region or domain chosen for phylogenetic 
analysis. Phylogenetic analyses on whole-protein alignments, 
both edited and unedited, yield similar tree topologies for all 
proteins. Focusing on smaller, more conserved regions that 
have adequate phylogenetic signals permitted these phylog- 
enies to be supported with greater confidence. 

As there is clearly phylogenetic incongruence across the 
SARS genome, we sought to identify the precise point of re- 
combination between the mammalian-like and avian-like par- 
ent viruses that gave rise to the present SARS virus. Previous 
studies that used a bootscanning method have found no evi- 
dence of recombination within the SARS genome (28). This 
lack of evidence is surprising since frequent recombination and 
mosaic evolution are more the rule than the exception in RNA 
viral evolution (for a review, see reference 1). Given that the 
two phylogenetically distinct partitions meet at the SARS S 
protein and that this protein is a critical host determinant, we 
focused on the S gene as a candidate locus for recombination. 
We performed a bootscan analysis on a gapless nucleotide 
alignment of the S proteins from four taxa with the SARS S$ 
protein as the query. 

Bootscanning revealed one prominent region of the S gene 
where recombination was likely to have taken place. High 
nucleotide identity between the SARS and feline infectious 
peritonitis virus S genes is evident in the first 1,800 nucleotides 
of the alignment (Fig. 4). This finding is particularly intriguing 
given recent reports that the SARS virus has been isolated 
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FIG. 4. Bootscan analysis of the S gene to detect recombination. 
Bootscanning was conducted with Simplot on a gapless nucleotide 
alignment, generated with ClustalX, with the SARS S gene as the 
query sequence. The dashed line denotes the feline infectious perito- 
nitis virus (X06170), while the solid line denotes the avian infectious 
bronchitis virus (M95169). The greater the percentage of permuted 
trees, the greater the sequence identity to the SARS S sequence. Other 
taxa that were included in the alignment but had less than 10% per- 
muted trees were murine hepatitis virus (AF208066) and human coro- 
navirus OC43 (L14643). 


from masked palm civets (7), which are phylogenetically and 
immunologically closely related to cats (32). The sequence 
identity between the SARS and feline coronaviruses is signif- 
icantly reduced between positions 1800 and 2000, where the 
sequence clearly becomes more similar to the avian coronavi- 
rus sequence. This region roughly corresponds to the region 
between nucleotides 2472 and 2694 of the SARS S protein but 
does not appear to be within the domain previously shown to 
bind the human aminopeptidase N receptor (3, 38). The region 
corresponding to the binding domain of the S protein lies 
between nucleotides 1253 and 1641 of the S gene. However, as 
this binding region is hypervariable among all coronaviruses 
(data not shown), large portions of it were eliminated from the 
alignment when gap-containing columns were removed. De- 
spite this, it has been noted that different regions of the S 
protein may serve as a binding region for different coronavi- 
ruses (22). It is therefore plausible that a recombination event 
in the middle of this gene may have resulted in a new viral 
variant with a modified host range. It should be noted that 
other taxa were included in the bootscan analysis but had 
percentages of permuted trees that were less than 10%, sug- 
gesting low similarity to the SARS coronavirus S gene. As a 
result, these taxa did not register on the bootscan output. 
Alternative alignments of the S gene analyzed by bootscanning 
yielded similar results, with the SARS S gene being a compos- 
ite of predominantly avian and feline coronavirus sequences 
(data not shown). These bootscan results support our phylo- 
genetic analysis of the S protein that places SARS in a mono- 
phyletic group with the avian and feline sequences. 
Conclusion. Despite the fact that the SARS virus does clus- 
ter with existing coronavirus groups, its genome sequence is 
substantially divergent, and a new taxonomic group has already 
been proposed (22, 24, 28). Irrespective of its taxonomic 
grouping, it is clear that the SARS coronavirus is a mosaic, 
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with at least two distinct evolutionary histories. Our analyses 
indicate that the SARS coronavirus is mammalian-like through 
the replicase protein, and avian-like through the M and N 
proteins and that there is a mammalian-avian mosaic in the S 
protein. The gene encoding the S protein lies between the 
replicase gene and the M and N genes. We propose that a 
recombination event likely occurred within the S gene, as dem- 
onstrated by our bootscan analysis. Since the S protein is re- 
sponsible for host specificity, this event may have been the 
critical step in the switch to a human host and the subsequent 
emergence of this new pathogen. The testing of this hypothesis 
will require extensive sampling of coronaviruses from a broad 
range of hosts. Identifying the source of the SARS coronavirus 
will not only lead to better public health practices but will also 
contribute to our understanding of how these important patho- 
gens evolve. 
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